Filters
Upload date
Any date
Last Hour
Today
This week
This month
This year
Duration
Any duration
Short (< 4 minutes)
Long (> 20 minutes)
Medium (4 - 20 minutes)
Sort By
Relevance
Rating
Upload Date
View count
Apply!
Videos
Web
34:32
Mixtral of Experts (Paper Explained)
54K views
•
4 months ago
Yannic Kilcher
Download
01:05:44
Stanford CS25: V1 I Mixture of Experts (MoE) paradigm and the Switch Transformer
26K views
•
1 year ago
Stanford Online
Download
33:47
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
31K views
•
3 years ago
Yannic Kilcher
Download
22:58
Efficient Large-Scale Language Model Training on GPU Clusters
4.4K views
•
2 years ago
Databricks
Download
11:31
Interquery Parallelism and Intraquery Parallelism in Query Processing
28K views
•
4 years ago
WIT Solapur - Professional Learning Community
Download
01:04:32
Stanford CS25: V4 I Demystifying Mixtral of Experts
4.2K views
•
3 weeks ago
Stanford Online
Download
22:54
Mixture of Experts LLM - MoE explained in simple terms
12K views
•
5 months ago
code_your_own_AI
Download
05:29
PARALLEL STRUCTURE | English Lesson
240K views
•
4 years ago
Kevin Spaans
Download
01:26:21
Mistral / Mixtral Explained: Sliding Window Attention, Sparse Mixture of Experts, Rolling Buffer
22K views
•
5 months ago
Umar Jamil
Download
55:07
Alpa: Automated Model-Parallel Deep Learning - Zhuohan Li | Stanford MLSys #59
5.6K views
•
2 years ago
Stanford MLSys Seminars
Download
58:23
Sparse Expert Models (Switch Transformers, GLAM, and more... w/ the Authors)
18K views
•
2 years ago
Yannic Kilcher
Download
30:46
CppCon 2018: Thomas Rodgers “Bringing C++ 17 Parallel Algorithms to a standard library near you”
3.8K views
•
5 years ago
CppCon
Download
08:38
Transformers: The best idea in AI | Andrej Karpathy and Lex Fridman
364K views
•
1 year ago
Lex Clips
Download
34:11
Aaron Richter- Parallel Processing in Python| PyData Global 2020
13K views
•
3 years ago
PyData
Download
56:39
Alpa: Automating Inter- and Intra- Operator Parallelism for Distributed Deep Learning
1.7K views
•
2 years ago
uwsampl
Download
58:57
AWS re:Invent 2022 - AI parallelism: How Amazon Search scales deep-learning training (CMP209)
612 views
•
1 year ago
AWS Events
Download
56:30
The TStreams Model: A new approach to parallel programming
30 views
•
7 years ago
Microsoft Research
Download
54:29
AI Engine Architecture: Data Movement, Synchronization, Reconfiguration & Application Mapping
2.3K views
•
1 year ago
Scalable Parallel Computing Lab, SPCL @ ETH Zurich
Download
01:11:36
Microsoft DeepSpeed introduction at KAUST
5.7K views
•
1 year ago
KAUST Supercomputing Laboratory
Download
08:46
How to Ski | 7 Steps to Parallel Turns
2.2M views
•
5 years ago
Stomp It Tutorials
Download
Next Page